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' Abstract. We give sharp, uniform estimates for the probabiHty that the empirical distri- 

, bution function for n uniform- [0, 1] random variables stays to one side of a given line. 

^ ' 1. Introduction 



Let Ui, . . . ,Un be independent, uniformly distributed random variables in [0, 1] and let 
u > 0, V > 0. Our goal is to estimate 

Fn{t) < (0 < t < r 

n 

\ where Fn{t) = ^X]t/<tl associated empirical distribution function. In 1939, N. V. 

(N " ' 

(N 

Q^ I A > the asymptotic formula 

§: (1.1) P(D+ < A) =g„(AV^,n) ^ l-e-2^' (n^oo). 

^ ■ When Ao < A = 0{n^^^) with fixed Aq > 0, sharper forms of (11.11) have been proven by a 
. number of people (e.g. [1(|; see also Ch. 9 of [13]), in particular 

(1.2) P{D-. < A) = 1 - e-^^ (l-^ + oC''^'^ 



3^1/2 



n 



I Here and throughout the Landau O— symbol has its usual meaning: /(■) = 0{g{-)) means 
1^ ■ \f\<cgioT some constant c, which is independent of the inputs to the function /. Also, 

f ^ g means / = 0{g) and f g means / = 0{g) and g = 0{f). 

One may ask about the behavior of Qn{u,v) for a wider range of the variables u,v. The 
strong Komlos-Major-Tusnady theorem |9!] implies 

\F^{t) - t - n-''^B^{t)\ « ^ (0 < t < 1) 

n 

with probability > 1 — 0{l/n), where B^it) is a Brownian bridge process. The order 
on the right side is also best possible [§| (see also Ch. 4 of [H). Since 



( sup {Bn{t) - (at + 6)) < ) = 1 - e-2^('^+''), 

Vo<i<l / 



and writing 

w = u + V — 
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the KMT theorem imphes the uniform estimate 



:i.3) 



Qn{u,v)=0{ -1+1 

= 1 - e-^""-/" + O 



2(M + O(logn))(tu+O(logn)) 



w 



log n) log n 



n 



This gives an asymptotic for Qn{u, v) provided oo, oo, and u+w = o{n/ logn) 

as n ^ oo. In the author's recent paper [6| on the distribution of divisors of integers, sharper 
information was needed for very small u and w. That paper includes a short proof of the 
crude bound Qn{u, v) -C ("+^)(^"'+^) uniformly in n > 1, m > 0, and w >0. 

By using different methods, we prove here new uniform estimates, which essentially remove 
the logarithm terms from the right side of (11.31) . 

Theorem 1. Uniformly in u > 0, w > and n > 1, we have 

'u + w 



Qn{u,v) = l-e-2""'/" + 



n 



In particular, if u 



oo and u + w - 
Qn{u,v) 



o(n) as n ^ oo, then 



1 



2. A RANDOM WALK WITH A BARRIER 

Exact formulas for Qn{u,v) are known, which we record below. 

Lemma 2.1. Assume n > 1 and v > 0. Then 

(i) Ifn-v<u<l, thenQr^iu,v) = ^{l + u/v)''-\ 



(ii) Ifn — v<u<n and u > 0, then 
Qniu,v 



1-^ y 

n / J 



V' 



u<j<n 



[V + U — j 



U 



Formula (i) is due to H. E. Daniels [2j] and (ii) is due to R. Pyke pjj. The case f = n in 
(ii) was earlier proved by Smirnov ij] . Starting with (ii), one may use a more complicated 
version of the complex analytic method of Lauwerier llO| to prove Theorem [H This was 
carried out in an early version of the author's paper |6|, a sketch of which may be found 
m (the English paper p\ includes a sketch of the argument below). We present below 
an elementary, probabilistic proof of Theorem [H Rather than work with (ii), we reinterpret 
Qn{u, v) in terms of a random walk. 

Lemma 2.2. Let Xi, . . . ,Xn+i be independent random variables, each with density function 
e^~^ if X < 1 and if x > 1. Put So = and Sj = Xi + ■ ■ ■ + Xj for j > 1. Then 



Qn{u,v) 



max Sj < u 

0<j<n 



Sn 



+1 



n + 1 — V 



Proof. Let Yi, ■ ■ ■ , l^„+i be independent random variables with exponential distribution, and 
let Wk = Fi + ■ ■ ■ + Yfc for 1 < k < n + 1. Let ^i, . . . , ^„ be the order statistics of f/i, ...,[/„, so 
that Qn{u, v) is the probability that > for every j. By a well-known theorem of Renyi 
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[12| |. the vectors (^i, . . . , and (Wi/Wn+i, ■ ■ ■ , Wn/Wn+i) have identical distributions. Sim- 
ilarly, given that Wn+i = v, the probability density function of the vector {Wi/v, . . . , Wn/v) 
is identically n\ on the set {(xi, . . . , x„) : < Xi < • • ■ < x„ < 1}. Therefore, 



Qn{u,v) 



min {Wi — i) > —u 

l<i<n 



Putting Xi = 1 — Yi completes the proof. 



□ 



The sequence 0, Si, S2, ■ ■ ■ can be thought of as a recurrent random walk on the real line, 
with Qn{u,v) being the probability that the walk does not cross a barrier at the point u 
given that it ends at the point n + 1 — v after n + 1 steps. A similar quantity may be 
defined for a random walk with the Xj having a different distribution. In the paper an 
analog of Theorem [T] is proven for a general walk whose steps Xj have a continuous or lattice 
distribution, but valid in a more limited range of the variables. More specifically, under 
appropriate conditions on Xj, we prove that 



max Sj < y 

0<j<n~l 



Sn = y - z 



uniformly for < ?/ < c-/n, < z < Cy/n (c being any fixed constant). 

Kolmogorov used a relation similar to that in Lemma [2.21 in his seminal 1933 paper [sl on 
the distribution of the statistic 

Dn = Vn sup \Fn{t) - t\. 

0<t<l 

Specifically, let Xi, X2, . . . , X„ be independent random variables with discrete distribution 



P[X,=r-l] 



T: 



(r = 0,l,2,...) 



and let Sj = Xi + ■ ■ ■ + Xj for j > 1. Like the variables Xj in Lemma 12. 2[ each Xj has mean 
and variance 1. Kolmogorov proved that for integers u > 1, 



P( sup \Fn{t) -t\< u/n) 

0<t<l 



nle 



max \Sj\ < u, Sn 

0<j<n-l 







max \Sj\ <u 



5„ = 



0<j<n-l 

Small modifications to the proof yield, for integers u>l and for n > 2, that 



Qn{u,n) 



max 5*0 < u 

0<j<n-l 



Sn = 



When V ^ n, however, it does not seem feasible to express Qn{u, v) in terms of the variables 



S,. 



Let /„ be the density function for Sn {n = 1,2,...). The Central Limit Theorem for 
densities (e.g.. Theorem 1 in §46 of [3]) implies that for large n and |x| -C y/n, fn{x) ~ 
(27m)~-^/^e~'^^/^". However, there are asymmetries in the distribution for |a;| > ^/n. We have 



(ra-x)" 



(2.1) Ux) = 

which is easily proved by induction on n. 



'^(n-l)! 



X < n 
X > n 
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Lemma 2.3. Let n >2. Then 

(i) fn{x) is unimodular in x, with a maximum value fn{^), OLnd fn{^) 



/2-Kn' 



(ii) Forx>Q, + < 

(iii) For each real z > Q, there is a unique number b = b{n, z) satisfying < b < z and 
f^{l-z)=Ul + z-b). 



Proof. Item (i) follows from 
(2.2) 



1 — X 



n — X 



fn{x) {x < n) 



and Stirling's formula. For (ii), suppose < x < n — 1. Then 



fn{l + x) 



1 - 



X 



1 + 



X 



exp 



-2E 



X 



2i+l 



-(2j + l)(n-l)2^- 



/n(l -x) V n-\) \ ' n-\ 

Item (iii) follows immediately from (i) and (ii). 

Using properties of 6(n, z), we will prove a sharper form of Theorem [H 
Theorem 2. Suppose n>l,l<u<Y^,l<w<j^ and let b = b{n + l,w). Then 

u{2w — b) 



< 1. 



□ 



Qn{u,v) 



{n — w + b){n + w — u) 



O 



u + w uw 

— |— I g n-\-u! — u 



n 



3. A RECURRENCE FORMULA 

Our principal tool for estimating Qn{u, v) is a recurrence formula based on the reflection 
principle for random walks : For y > and y > x, a recurrent random walk of n steps that 
crosses the point y and ends at the point x is about as likely as a random walk which ends 
at 2y — X after n steps. For convenience, define 



Rn{x,y) = /„(x)P 



max Sj < y 

0<j<n~l 



Sn — X 



D 



max Sj < y, Sn = x 

0<j<n-l 



where the last expression stands for the density function ^P[T„_i < y, Sn < x]. From the 
reflection principle we expect that Rn{x,y) ~ fn{x) — fn{2y — x). The next lemma gives a 
precise measure of the accuracy of the reflection principle for our specific random walk. 

Lemma 3.1. For a positive integer n >2, real y > 0, real x, and real a > 1, 

„1 n-l 

(3.1) Rn{x,y) = fn{x)- fn{y + a)+ / y2Rk{y + ^,y){fn-k{a-^)-fn-k{x-y-^))d^. 

-^0 fe=l 

Proof. Define Tj = max(5'o, . . . , Sj). Start with 

Rn{x,y) = fn{x) - fn{y + o) + /„(?/ + o) - B[Tn-l > y, Sn = x]. 
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If Sn = y + a, then there is a unique k, 1 < k < n — 1, so that Tfc„i < y and Sk > y. Thus, 



n-l 



fn{y + a) = ^ D[Tfe_i < S'fc > ?/, 5^ = y + a] 
fc=i 

n-l „i 

= / D[rfc_i < S'fc = ?/ + ^, S'n = ?/ + a] c?^ 
fc=l -^0 

n-l „i 



Similarly, 



D[T„_i > 5'„ = x] = D[Tfe_i <y,Sk> y, Sn = x] 

k=i 

n-l „i 

k=l 



□ 



In Lemma Em choosing a ^ y — x — b{n, y — x) should make |/n_jt(a — ^) — — y — ^)| 

small for small k. Also, we expect Rk{y + to be small, especially for large fc, so the 
integral-sum on the right of (13. ip will be treated as an error term. 

The same argument provides an analogous formula when the steps in the random walk 
have an arbitrary distribution (see ^). 

We next give a crude estimate for Rn{x,y) when x > y which will be used on the right 
side of flO) . 

Lemma 3.2. Ifk>l,y>0, andO < jj <1, then Rk{y + ^i,y) ^fk{y)- 

Proof. Without loss of generality, suppose k > 10 and < y < ^. By Lemma 1231 (ii), when 
1 < J < ^ - 1, /i(4 - < fjif^ - 0- By Lemma O (with a = 4 and x = ?/ + /i) and CT . 

t - K iy + ^)h{y) 



Rk{y + i^,y)< fkiy + /x) - fkiy + 4) = / j—Jkit) dt <^ 

Jy+fj, 'f' 



4. Estimates for /„(x) 

Lemma 4.1. We have 

(i) Ifn > 20 an(/ < z < f^, t/ien 6(n, 2) < f and b{n, z) = + 

(ii) Ifn > 1 ant/ |a;| < f , i/ien ri-VSg-^V" ^ /^(j;) ^ ^-i/2g-xV3n . 

(iii) Ifl<h<H < 10x2, t/ien fh{x)h-^ < fH{x)H~^; 

(iv) Ifl<k<n, then fk{x) < (n/A;)i/2/„(a;) . 

Proof. First, writing 6 = 6(n, 2;), we have 

n — 1 + z 



□ 
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Under the hypotheses of (i), let t = , so that < t < | by Lemma [23] (hi). Then 



n — 1 



which imphes 



t = 2 



n — 1 



iog(i -t)+t 

t 2 3 



O 



3\n-l 



n — 1 



The asymptotic for b fohows. Since | + y + -- -<|t, & <| and this proves (i). 
Item (ii) is trivial when n < 100. When n > 100, (12.11) and Stirling's formula give 



/„(x)xn-i/V-Ml- 



a; — 1 
n — 1 



n— 1 



n ^^"^ exp 



(x-1) 



9 °° -1 / 1 \ m—2 



n 



— 1 ^-^ m\n — 1 



m=2 



Since l^ryl — 0-35, the sum on m is between | and |, which proves (ii). 



Since fn{x)n 



n 



for n > (\r-)^, it suffices to prove (iii) when H < (\r-)^. For 



l<h< and h>x, 



h{x)h-' X g{h) := h-'/'e^-' 1 



We have 



X — 1 

h-1 

X — 1 



h-1 



h — X 



> 0, 



and (iii) follows. If |x| < y/n, Lemma 12731 (i) and part (ii) above imply fk{x) ^ k and 
fn{x) ^ n~^^'^. When |x| > ^/n, applying (iii) gives fkix) <^ fn{x) <^ {n/ky/'^ fn{x), proving 
(iv). 



□ 



(4.1) 



A useful corollary of Lemma 13.21 and Lemma 14.11 (iv) is 

n^''^ufn+i{u) 



Rk{u + i,u) < 



(1 < A; < n + l,u > 1,0 < ^ < 1). 



Lemma 4.2. Suppose n > 100, I < w < f^, b = b{n + l,w) and < ^ < 1. 

(i) Ifw^/'^ < h <n, then 

\f,{l + vj-b-0-fh{l-w-0\<^ + 

(ii) If2y/n < w < and 1 < k < n — 3w, then fn+i-ki^+w — b—^) and fn+i-k{^—w—C,) 
are each 

^.«(.-.)e.[E(l(.-^).0g)).0(^ 

Proof. Assume w^^'^ < h < n and write 
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where, by Lemma [4.11 (i). 

/ h-l 1 {h-l)(2w-h) ^(w^ 



/i-i + u; + CV 2 V /i-i + w + ^yy v/i2 

By hypothesis, <^ 1 and hence 

<|i?|A(l-«;)«(^^ + ^) /,(!-«;). 

This proves (i). 

To prove (ii), we write 



e 



j=ri— A; 



say. By (12. ip and the hypothesis on w, A <^ ^ and 

i?, = l + (j-l)logfl-— -l^^Vlogfl+ "^"^"^ 



i — w + h + J V + ^ + ^ 



j-'u; + 6 + eV 2(j-«; + 6 + 2{3-w + h + i) \j j 
1/1 lU^^ „ f 



2 

Arguing similarly, 

/n+i-fc(l - 10 - ^ ^c+Z)„_fc+-+D„^ 
fn+l{l-w) 

where C < H and = 2^(1 - ^) + ^(wVj^). Combining (g^]), (US]), the above estimates 
for A, Bj, C and Dj, and the relation /n+i(l — li?) = /n+i(l + w — concludes the proof of 
(ii). □ 

5. PROOF Theorem [2] 

Without loss of generality, suppose n > uq, where Uq is a large absolute constant. We 
apply Lemma [XT] with a = 1 + w — b, where b = b{n + 1, w), obtaining 

n 

(5.1) Rn+l{n + 1 - f ,m) = fn+l{u + 1 - W) - /n+l(M + 1 + W - &) + ^ Afc, 

fc=l 

where 

|Afc| < max i?fc(M + ^,m)|/„+i_a:(1 + w - & - " /n+i-fe(l - - 0|- 
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If n < M^, then fk{u)/k ^ fn+i{u)/n for 1 < A; < n by Lemma [4.11 (iii). If n > m^, then we 
have 

fkju) ju~^fi^2^{u) X if A; < 

k if k>U^ 

by Lemma [4.11 (ii). (iii). In both cases, 

(5.2) ^AM«(i + !!!^!)/„^.(„,), 

k=l ^ ^ 

Suppose that I < w < 2y/n, so that b = 0(1). We will prove that 

(5.3) ^|A,|«^i^/„+i(«)«^^+i(t.)/„+i(l-^) (l<^<2v^). 



k=l 



The second inequality follows from the first and Lemma Hm (ii). Let h = n+l—k, ho = [w^/^J 
and hi = [w'^/10\. Choose no > 2^° so that ho < n/2. For 1 < h < ho, (14. ip and Lemma 
O (ii) give 

An+i-h < z B,^?, + w -b-^) + fh{l - w - 0) < — 171 — • 



n o<5<i nty 
If ho < h < hi, then (14. ip . Lemma [4.21 (i) and Lemma [4.11 (ii),(iv) imply 

^"^^-^ U + 7^J "^^ U + 7^J ^'^^^^ "^^ 

When hi < h <^, fjiTT]) and Lemma H^l (i) imply 

M/n+l(M) W Mw/„+i(m) 
An+i_h < T^'^'^l - W) < :— . 

n h nw^i^ 

Summing on /i < ^ we obtain 

(5.4) Y. « 

Lemma 13.21 Lemma 14.21 (i) and (15. 2p imply 

EUW fk{u) U + W ^ , , 

n/2<h<n l<fc<n/2+l 

Combined with f l5.4p . this proves (15. 3p . 
Next, suppose 2A/n < w < and set 

(in 3 

K 



TV 

min ( n — CoW, — 



where Co is a large absolute constant. When 1 < k < K , apply Lemma [3.21 and Lemma [4.21 

4(i-«^Vj)<-i^<-S 



(ii), observing that for each j < n, ^(1 — j f) < — 14 < — If A; < n/2, then 



(5.5) A. « .Mf) (!£ + i!£!) e--'/(io..,;^^^(i _ 

K V n n'^ / 
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When n/2<k< K, 

M/fc(M)/n+l(l - W) 



-e 6{n-fc) gxp < C 



[n — kY n 



1 



for an absolute constant Ci. If in addition n — k > w^/^, then 



e 6("-fc) j exp < Ci 



w w 
+ - 



1 « 



(n — kY n 



(n — ky n 

which imphes (15. 5p . If Cqw < n — k < w^^'^ and we take Cq = 20Ci, then 

Q-6(7rrk) I gxp <^ Ci 



H e 61^^, 



w 



w 
+ - 



{n — kY n 



1 1 < e 12(n-fc) < n lOn 



and (15.51) follows in this well. 
By Lemma EKiv), (Q and (l53|) . 



(5.6) 



J2 l^fcl <^i/n+l(l 



— W 



k<K 



w sr^ fk{u) , w^fn+i{u) 



72 — ' k 

k<K 



+ 



5/2 



fc=l 



l/2g-fcu;2/(10n2) 



■C /n+l(l - W;)/n+l(M) 



+ 



When k > K, we combine Lemma [4.11 (i), (iii) and Lemma [4.21 (ii) to obtain 

/„+l_fe(l + W -b-^) + /„+i_fc(l - W - < fn+l-K{l + W -b-^) + /„+i_x(l -W-^) 

Together with (14.11) . this gives 



i(l — t/7)e 



K<fc<n 

If 2^/n < w < n^/'^, then i^T = [n — 3w\ and 



K<k<n 



K<k<n 

If n^/^ < tt' < ffT, then K > n? /2w^ and 



^3/2 ^3/2 



-Kw'^/{Wv?) 



K<k<n 



Therefore, 



|Afc| < /„+i(l -«;)/„+! 

K<k<n 

Combined with (15.61) . we have 

" / 2 \ 

(5.7) I^^l « - 1^2+^2) (2v^ 



< < n/10). 
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Combining (15. ip . (15.31) and (15.71) with Lemma [2.21 in all cases we have 



n ( ^ 1 fn+i{u + l + w - h) 

Qn[u,v) = 1 ; — ^ h O 



fn+l{u +1-W) 

By the definition of 6, 

fn+l{u + l + W - h) 
fn+l{u + 1-W) 



Also, by Stirling's formula, 

n^/Vn+l(l -W)fn+liu) 



fn+iiu + l-w) I n 

_ fn+l{u +1 + W - &)/n+l(l - W) 
+ W - h)fn+l{u +1-W) 

_ / u{2w-h) y 

\ {n — w + h){n + w — u) ) 
n^/'^in +IY ({n + l-u){n + w)Y 



u + w uw 

+ 



fn+l{u + 1-W) 



which concludes the proof of Theorem [21 



[n + l){n + w — u) 



« 1 



u{w — 1) 



{n + l){n + w — u) 



<^ g n-\-w — u 



□ 



6. PROOF Theorem [T] 



We may assume < u < Sn and < w < 6n for a small, fixed, positive 6. If < m < 1 
and < w < 6n, Lemma ET] (i) implies Qn{u, v) <^ w/n. When < w < 1 and 1 < m < 6n, 
Lemma [32] implies Qn{u, v) -C ^. When 1 < u < 6n and I < w < 6n, we may assume that 
n is large. The error term in Theorem [2] is 



U + W W uw uw u + w 

< \ e"^ < . 

n n n n 



When uw > n^/^, the main terms are 



1 -0(e-2" 



n 



When uw < n*^^, the main terms are, by Lemma H?T] (i) 

u{2w — h)n 



1 — exp 
1 — exp 



{n — w + h){n + w — u) 



2uw 



n 



1 + 



u + w 



n 



-O 
+ 



uw] 



[uw)" 



2uw 

1 - e ~ + O 



2uzu 

1 - e~~ + O 



u + w + {uwY^"^ 



n 



u + w 



n 



□ 
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